Improve resilience and performance of do_bulk_inference #128

mhaas · 2022-05-19T16:20:26Z

In case of errors, the InferenceClient.do_bulk_inference method
will now return None for the affected objects instead of aborting
the entire bulk inference operation (and discarding any successfully
processed objects).

Fixes issue #68

The fix for #68 is different than what is described in #68. Instead of
using a generator based approach which will require the SDK consumer to
implement the error handling themselves, the SDK itself now handles the
errors. The downside of not using a generator is a larger memory footprint
to accumulate the results in a list. As an alternative, we can consider
using a generator to either yield the successfully processed inference
results or the list containing None. This approach will save memory.

Additionally, this commit introduces parallel processing in InferenceClient.do_bulk_inference.
This will greatly improve performance. Due to the non-lazy implementation of
ThreadPoolProcessor.map, this increases memory usage slightly (cpython issue #74028)

Checks:

In case of errors, the `InferenceClient.do_bulk_inference` method will now return `None` for the affected objects instead of aborting the entire bulk inference operation (and discarding any successfully processed objects). Fixes issue #68 The fix for #68 is different than what is described in #68. Instead of using a generator based approach which will require the SDK consumer to implement the error handling themselves, the SDK itself now handles the errors. The downside of not using a generator is a larger memory footprint to accumulate the results in a list. As an alternative, we can consider using a generator to either yield the successfully processed inference results or the list containing `None`. This approach will save memory. Additionally, this commit introduces parallel processing in `InferenceClient.do_bulk_inference`. This will greatly improve performance. Due to the non-lazy implementation of `ThreadPoolProcessor.map`, this increases memory usage slightly ([cpython issue #74028]) [cpython issue #74028]: python/cpython#74028

…ence

This also contains a documentation formatting fix.

This reverts commit 4372b8f.

setuptools must be installed much earlier in the process.

SreevishnuAB

Thanks!

The bulk_inference code is now multithreaded. For this reason, the trick of returning different values in the Mock based on the order of the calls no longer works. This was somewhat accidentally working on CPython, but not on pypy.

This is mainly useful to fix the tests which rely on the mocks being called in a certain order. One of the tests supports concurrency by mocking in a better way, but this was not feasible for the other tests. This commit also updates the documentation build tools to the latest version to fix the documentation build on my local machine.

mhaas added 12 commits May 19, 2022 18:13

Merge remote-tracking branch 'origin/main' into bulk_inference_resili…

79c3fa1

…ence

fix: bulk inference error response should include object_id

b2934aa

This also contains a documentation formatting fix.

chore: update CHANGELOG.md

8d20964

chore: fix Python 3.5 compat

9ea705e

chore: attempt to fix build error on Python 3.7

4372b8f

Revert "chore: attempt to fix build error on Python 3.7"

67be9ce

This reverts commit 4372b8f.

chore: Fix build for Python 3.7

3c60dde

Fix build on Python 3.7, second attempt

136592e

setuptools must be installed much earlier in the process.

Only install setuptools on Python 3.7

456ca8d

Fix syntax

3d0432c

chore: make pytest output verbose

e9403e7

mhaas requested a review from SreevishnuAB June 17, 2022 07:59

SreevishnuAB approved these changes Jun 17, 2022

View reviewed changes

bulk_inference: fix tests on pypy

accc46b

The bulk_inference code is now multithreaded. For this reason, the trick of returning different values in the Mock based on the order of the calls no longer works. This was somewhat accidentally working on CPython, but not on pypy.

mhaas force-pushed the bulk_inference_resilience branch from 264db6f to accc46b Compare June 17, 2022 12:55

mhaas added 2 commits June 17, 2022 15:00

fix: Python 3.5 has no f strings

f0d4169

mhaas force-pushed the bulk_inference_resilience branch from 31da951 to 5bbf4a2 Compare June 17, 2022 15:36

mhaas merged commit 68cf17a into main Jun 17, 2022

mhaas deleted the bulk_inference_resilience branch June 17, 2022 15:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve resilience and performance of do_bulk_inference #128

Improve resilience and performance of do_bulk_inference #128

mhaas commented May 19, 2022 •

edited

Loading

Uh oh!

SreevishnuAB left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Improve resilience and performance of do_bulk_inference #128

Improve resilience and performance of do_bulk_inference #128

Conversation

mhaas commented May 19, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

SreevishnuAB left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mhaas commented May 19, 2022 •

edited

Loading